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1.  Introduction 


Let  E be  a vector  space  of  dimension  2p  over  the  field  of  real 


numbers  JR  . Let  x , ...,x^  (N  >_  2p)  be  identically  distributed  inde- 
pendent observations  from  a normal  distribution  with  mean  value  0 and 


unknown  covariance  £ . That  is,  £ is  a positive  definite  form  on  the 


dual  space  E*  to  E . The  maximum  likelihood  estimator  £ for  £ is 


well-known  to  be  given  by 


„ 1 N 

£ (x1,...,xn)  = ( (x*,y*)  l x*(xj.)y*(xi)  ; x*,y*6E*)  „ 

i=l  x 


The  distribution  of  £ is  the  Wishart  distribution  on  the  set  p(E*) 

of  positive  definite  forms  on  E*  with  N degrees  of  freedom  and 
parameter  ^ £ . Suppose  now  that  E is  also  a vector  space  over  the 
field  C of  complex  numbers  such  that  the  restriction  to  the  subfield 
of  real  numbers  in  <C  is  the  original  vector  space  structure  on  E . 

The  dimension  of  E as  a vector  space  over  <E  is  then  p . The  vector 

space  E*  is  then  also  a vector  space  over  the  complex  numbers  under 

the  definition  zx*  = x*  o z = (x  x*(zx)  ; x £ E)  , x*€  E*,z  £€  • The  set 

p_(E*)  = {£  € p(E*)  | £ (zx*,  y*)  = £(x*,i  y*)*¥x*,  y*  £ E*,  . ¥ z £ «} 
defines  a nulhypothesis  in  the  statistical  model  described  above.  The 
condition  £(zxA,  y*)  = £(x*,  z y*)  , ¥ x*,  y*  £ E*»  ¥ z £ C is  in 

Andersson  [2]  called  the  C-property  and  in  terms  of  matrices  it  has 

the  formulation:  For  every  basis  eA,...,eA  for  the  complex  vector  space 
E*  the  matrix  for  a £ with  the  C-property  with  respect  to  the  basis 


e*» . . e 


(1.1) 


ie*  for  the  real  vector  space  E*  has  the  form 
P 


The  statistical  problem  of  testing  Z g Pc(E*)r  versus  Z g p(E*)r  is 
invariant  under  the  action  of  the  group  GLg(E)  of  complex  one-to-one 
linear  mappings  onto  the  sample  and  parameter  space  p(E*)r  given  by 


(1.2) 


GL£(E)  x p(E*)r  -*■  p(E*)r 

(f,Z)  •>  Z o ( f *xf * ) 


where  f*  is  the  dual  mapping  to  f g GL£(E)  . The  restriction  of  the 
action  to  the  subset  p (E*)  is  transitive.  Since  all  tests  invariant 
under  (1.2)  have  a factorization  through  a maximal  invariant  function  we 
shall  find  a representation  of  a maximal  invariant  function  into  , 

describe  the  distribution  as  a density  with  respect  to  a restriction  of 
the  Lebesque  measure  and  state  an  interpretation  of  this  representation. 
The  matrix  for  a complex  linear  mapping  of  E with  respect  to  a basis  of 
the  form  e1,...,e  , ie^,...5ie^,  where  e^,...,e^  is  a basis  for  the 
complex  vector  space  E is  of  the  form 


(1.3) 


A B 
-B  A 


The  expression  Z o (f*xf*)  from  (1.2)  in  matrix  formulation  becomes 


(1.10 


with  respect  to  the  dual  basis  e*,...,e*,  ie*,...,  ie*  in  E*  to  e.,. 

p _L  p r 
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2,  Representation  of  the  maximal  invariant 


2.1.  Lemma . Let  II  be  a positive  definite  form  on  the  R-space  E. 
Then  there  exists  a basis  e^,...,e  for  the  (E-space  F such  that  the 
2p  x 2p  real  matrix  for  IT  with  respect  to  e , ...,e  , ie, ,...,ie  has 

1 p 1 p 

the  form 


(2.1) 


where  I is  the  p x p identity  matrix  and 


(2.2)  D,  = diag(X_.  ) with  1 > A1  > . . . > A > 0 . 

A -L  p x — — P — 


Furthermore,  the  matrix  D-^  is  uniquely  determined  by  II  ; and  if 

A,  > A~  > . . . > A >0,  then  IT  also  determines  the  basis  e-,...,e 
i 2 p ip 

uniquely  up  to  the  sign  of  each  basis  vector. 


Proof : Let  e|,...,e^  be  a basis  for  the  C-space  E and  let 

i 

n3 

it 


11 

n!2 

. f 

12 

n22 

be  the  2p  x 2p  real  matrix  for  II  with  respect  to  e^,...,e  , 
ie|,...,ie\  The  assertion  is  then  that  there  exists  a nonsingular 
complex  p x p matrix  = A + iB  such  that 


and  that  D,  is  unique;  and  in  the  case  where  A..  > ...  > A > 0,  the 
A f 

columns  of  Z are  unique  up  to  multiplication  with  ± 1. 

The  equation  (2.3)  is  equivalent  to  the  complex  matrix 
equations 


(2.4) 


Zl(2<II12  + II12>  + 1 2 <nu_n22^Z  " DX 


If  we  define  Z = 


and 


(2.5) 


$ 2(ILLl+n22^  +1  2(I112  II12)  ’ 


2'  11  22' 


then  (2.4)  becomes 


(2.6) 


$ = z'z 

¥ = z’d-jZ 


Since  $ respectively  ¥ is  the  matrix  for  a positive  definite 
hermitian  form  respectively  symmetric  form  on  the  C-space  E,  it 
follows  from  [3]  that  we  can  find  a complex  pxp  diagonal  matrix  D 
and  a complex  nonsingular  p x p matrix  Y such  that 


(2.7) 


$ = y'y 


T = Y'DY 


By  permutation  we  can  obtain  that  the  diagonal  elements  d^,...,d  of 
D have  the  property  | d^  | >_  | \ | d | . If  we  then  multiply 
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the  v'th  row  of  Y with  exp[-i6^/2]  , where  = |dvJexp[i0^]  , 

V = l,...,p,  and  call  this  new  matrix  for  Z,  we  obtain  (2.6)  with 
A^  = |d  |,  V = l,...,p  . Since  II  is  positive  definites  we  have 
1 > A,  > . . . > A >0.  The  uniqueness  follows  from  a rather  elemen- 

1 — p — 

tary  examination  of  the  proof  in  [3]  or  from  direct  matrix 

calculation.  Since  every  matrix  of  the  form  (2.1)  with  1 > A^  _>  . . . 

> Ap  > 0 is  positive  definite  it  follows  from  Lemma  (2.1)  that  the 
mapping  from  p(E*)^  onto  = {(A^...,Ap)  6 R^  | 1 > A^  >_  . „ . >_  Ap  >_  0} 
determined  from  Lemma  2.1  is  a maximal  invariant  function. 

3.  Canonical  correlations  with  respect  to  a complex  structure. 
Interpretation. 

It  follows  from  Lemma  2.1  that  there  exists  a basis 
elS...,e  for  the  C-space  E such  that  the  2p  x 2p 

1 P 

matrix  for  £ with  respect  to  e*,...,e*,  ie*,...,ie*  has  the  form 

(2.1) .  In  (2.1)  is  unique;  and  if  A^  > ...  > A^  > 0,  the  basis 

e*,...,e*  for  the  C-space  E*  is  unique  up  to  a sign  for  each  element. 
Ip 

A.  is  called  the  i-th  theoretical  canonical  correlation  of  £ 

“1 

with  respect  to  the  complex  structure,  and  e*  is  called  the 
j-th  theoretical  canonical  linear  form  of  £ with  respect  to  the 
complex  structure  j = 1, . . . ,p.  Let  x € E*  have  coordinates  (a^,...,a  , 
(3^,..., Bp)  with  respect  to  e*,...,e*,  ie*,...,ie*.  Then 

(3.1)  £(x*,x*)  = £ + £ 6^  + 2£Aiai3i 

i i i 

£(ix*,ix*)  = £ aj  + £ - 2 £ Aia1  6i 

i i i 


(3.2) 


(3.3) 


£(x*,ix*)  = £ A^a^T-B2) 

i 


Consider  the  problem  of  maximizing  £(x*,ix*)  under  the  conditions 
£(x*,x*)  = £(ix*,ix*)  = 1 . This  is  equivalent  to  maximizing 


(3.4) 


£ A.(a2-B2) 

li  i 


subject  to  the  conditions 

(3.5)  £ a2  + £ 6^ 

i i 


1 and  £ A . a . 8 . = 0 
xii 
i 


If  we  suppose  that  > ^2  > > 0,  we  get  by  using 

Lagrange’s  multipliers  that  the  maximum  point  is  achieved  at  = ± 1, 

a„  = . . . = a = B,  = . . . = 6 =0,  and  the  maximum  value  is  A.,  . By 

2 p i p i 

induction  it  follows  that  ± e*  are  the  only  linear  forms  uncorrelated 

with  e*,...,e*  ..  for  which  £(e*,e*)  = £(ie*,ie*)  = 1 and  £(e*,ie*)  is 
1 3-1  33  33  33 

maximal.  The  maximum  values  are  A^ , j =l,...,p. 

The  canonical  correlations  A^,...,A  with  respect  to  the 
complex  structure  can  be  found  as  the  positive  roots  of  the  equation 
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where 


£ = 


E11  E12^ 


y ' y 
12  22/ 


with  respect  to  a basis  of  the  form  f*, ...,f*,  if*, ...,if*. 
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^ . The  distribu tlon  of  the  empirical  canonical  correlations  with 
respect  to  a complex  structure . 

/\ 

The  estimator  E(x^,...,x  ) for  Z in  the  observations  point 

(x^,.„.,xN)  is  given  in  the  introduction.  Suppose  that  Z £ pj,(E*)r 

and  let  e*,..„,e*  be  a basis  for  E*  such  that  the  2p  x 2p  matrix 

for  Z with  respect  to  the  basis  e*s . . . se*,ie£, . . . ,ie*  is  the 

-L  p _L  p 

A 

2p  x 2p  identity  matrix.  The  distribution  of  Z in  terms  of  matrices 
is  a Wishart  distribution  with  a representation  as  a density  with 
respect  to  the  restriction  of  the  Lebesgue  measure  to  all  positive 
definite  2p  x 2p  matrices  p(fl^P)r  as  follows 

(4.1)  c • jdet  Q | (N-2p-l) /2  exp{_  1 tr(0)}d0>  0ep(iR2P)  . 

The  canonical  correlations  and  linear  forms  (with  respect  to  the 

/\ 

complex  structure)  of  E(x^,...,x  ) is  called  the  empirical  canonical 

correlations  and  linear  forms  with  respect  to  the  complex  structure. 

The  classical  theory  of  canonical  correlations  is  due  to  Hotelling  [4]. 

We  shall  find  the  distribution  of  these.  If  we  define  § and  'F 

from  the  2p  x 2p  real  matrix  0 , as  in  formula  (2.5)  , we  have 

a one-to-one  and  onto  mapping  between  P (R^P) ^ and  p(CP)  x g(CP), 

where  P(CP)^_  respectively  §(£P)  denotes  the  set  of  positive  definite 

hermitian  respectively  symmetric  pxp  complex  matrices,  with 

Jacobian  1.  Furthermore,  (2.6)  defines  a one-to-one  mapping  from 

GL_|_(CP)  XP  into  P (CP) ^ x g (CP)  , where  GL_j_(CP)  is  the  subset  of  all 

nonsingular  p x p complex  matrices  with  a positive  real  part  in  the 

first  row  and  P = {(A15...,A  )€Rp|l>A>...>A  > 0} . 

1 P 1 p 
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The  complementary  to  the  image  (which  is  an  open  set)  of  this  mapping 
has  Lebesgue  measure  0;  and  therefore  from  our  distribution  point  of 
view,  we  can  forget  this.  To  find  the  Jacobian  of  this  mapping 
defined  by  (2.6),  we  proceed  as  in  [l].  The  method  is  due  to  Hsu  [5]. 
We  have 


(4.2) 


d$  = (dz')Z  + z'(dZ) 

d'F  = (dz')AZ  + Z*(dA)Z  + z'A(dZ) 


and  we  shall  find  the  absolute  value  of  the  determinant  of  the  linear 
mapping  (dZ,dA)  -*  (d^jd'F)  defined  by  (4.2  ).  This  is  a composition 
of 


(a) 


(b) 


(c) 


'(dZ)Z  1' 

' dW  \ 

dA  j 

\dA/ 

dw'  + dW 


dw’A  + dA  + AdW 
z'dYZ  ' 1 


The  Jacobians  are  |det  Z|  , \ det  z|^(2p+-)  reSpect^ve^y 
c^  A^  IL  (A2 -A2)  for  (a),  (c)  respective  (b)  . Since 

tr(0)  = 2 tr($)  = 2 tr(z'z)  and  |det  0|  = |det  ZTZ|^  IIP_^  (1-A2) 
(4.1)  is  transformed  to  the  distribution 


c 2 • | det  Z ' Z j ^ 15 

exp{-  tr(z'z)}  S A.(l  - X2)(N_2p_1)/2  n (A2- X2)dZ  © dA 

Z , , 1 1 » . * 1 1 ••tl 

1=1  1<J  J 1=1 
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on  GL+(<DP)  xfi.  Integrating  over  Z€OL+(CP),  we  get  the  distribution 

of  f = A2  . .,f  = A2  : 

11  P p 

(4-4)  c n (l-f.)<N~2p_1)/2  n (f.-f.)df 

3 1=1  1 l<j  1 1 1 p 

onfi=  ( (f x , . . . ,fp) € RP| 1 > f1 > ...  > f > 0}.  Formula  (13)  in  [l] , 
p.  324,  for  p^  = p-1  and  P2  = P gives  the  normings  constant 
namely. 


(4.5) 


P-i  rokN-i)) 

n — 

i=i  r(-|(N-P-  i))T(j(P-  i))r(-|(P  + i-  i)) 
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